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Modelling of SARS for Hong Kong 
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A simplified susceptible-infected-recovered (SIR) epidemic model and a small-world model are 
applied to analyse the spread and control of Severe Acute Respiratory Syndrome (SARS) for Hong 
Kong in early 2003. From data available in mid April 2003, we predict that SARS would be controlled 
by June and nearly 1700 persons would be infected based on the SIR model. This is consistent with 
the known data. A simple way to evaluate the development and efficacy of control is described and 
shown to provide a useful measure for the future evolution of an epidemic. This may contribute 
to improve strategic response from the government. The evaluation process here is universal and 
therefore applicable to many similar homogeneous epidemic diseases within a fixed population. A 
novel model consisting of map systems involving the Small- World network principle is also described. 
We find that this model reproduces qualitative features of the random disease propagation observed 
in the true data. Unlike traditional deterministic models, scale-free phenomena are observed in the 
epidemic network. The numerical simulations provide theoretical support for current strategies and 
achieve more efficient control of some epidemic diseases, including SARS. 

PACS numbers: 89.75.Hc, 87.23.Ge 



During 2003 SARS killed 916 and infected 8422 
globally _1J. In Hong Kong (HK), one of the most severely 
affected regions, 1755 individuals were infected and 299 
died0. SARS is caused by a coronavirus, which is 
more dangerous and tenacious than the AIDS virus be- 
cause of its strong ability to survive in moist air and 
considerable potential to infect through close personal 
contact 0, 0, H @|- Unlike other well-known epidemic 
diseases, such as AIDS, SARS spreads quickly. Although 
significant, its mortality rate is, fortunately, relatively 
low (approximately ll%)jl|. Researchers have decoded 
the genome of SARS coronavirus and developed prompt 
diagnostic tests and some medicines, a vaccine is still far 
from being developed and widely used0,ll,l3. The dan- 
ger of a recurrence of SARS remains. 

Irrespective of pharmacological research, the epidemi- 
ology study of SARS will help to prevent possible spread- 
ing of similar future contagions. Generally, current epi- 
demiological models are of two types. First, the well- 
known Susceptible-Infected-Recovered (SIR) model pro- 
posed in 1927^3 0|. Second, the concept of Small- 
World (SW) networks^. Arousing a new wave of epi- 
demiological research, the SW model has made some 
progress recently [T^ Im 0|. Our work aims to model 
SARS data for HK. Practical advice for a better con- 
trol are drawn from both the SIR and SW models. In 
particular, a generalized method to evaluate control of 
an epidemic is promoted here based on the SIR model 
with fixed population. Using this method, measuring 
the spread and control of various epidemics among differ- 
ent countries becomes simple. Quick action in the early 
stage is highlighted for both government and individuals 
to prevent rapid propagation. 



I. SUSCEPTIBLE-INFECTED-RECOVERED 
MODEL 

In the SIR model, the fixed population N is divided 
into three distinct groups: Susceptible S', Infected / and 
Removed R. Those at risk of the disease are suscepti- 
ble, those that have it are infected and those that are 
either quarantined, dead, or have acquired immunity are 
removed. The following flow chart shows the basic pro- 
cession of the SIR model jl^] . 
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Here r and a are the infection coefficient and removal 
rate, respectively. A discrete model was adapted by Gani 
from the original SIR model through the coarse-graining 
process and was applied to successfully predict outbreak 
of influenza epidemics in England and Wales |T^ll7|. 
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During the epidemic process, N — Si + li + Ri is fixed. 

Initially, we examine the data for the first 15 days to 
estimate the parameters r and a of SARS for HK. The 
only data are new cases (removal R) announced everyday 
by HK Dept. of Health from March 12, 2003 followed by 
a revised version later 0- To avoid inadvertently using 
future information we do not use the revised data at this 
stage. Population TV = 7.3 x 10*^; since / -f _R < iV it is 
reasonable to let Si = N in right hand side of ||2Jl . /i = 1 
and i?i = is set as the initial condition. li is replaced 
with Ri^i whereas li is uncertain. This assumption im- 
plies the incubation period is only one day, in spite of the 
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fact that the true incubation period of the coronavirus is 
2-7 daysQ- The parameters r and a are scanned for 
the best fit for the stage. For every (r, a), a sequence 
of R'j^j^i is obtained by numerical simulations. An Eu- 
clidean norm of X^i^i (-^i-i-i ~ which indicates a 
distance between the true and simulated data, is applied 
to measure the fit. In Fig. ^ (r, a) of the lowest point 
is the best fit parameters for this stage and this value is 
used for the following prediction. We get r = 2.05 x 10"'' 
and a = 1.444. The method is applied to get parameters 
of a and r in Figs. [5]and|31 




FIG. 1: The quantity ^\'L-^ {Ri+i — R'i+i)^ vs. r and a is 
plotted as dots for the first 15 days SARS data for HK. The 
natural logarithm is applied. 7? and R' are original and sim- 
ulated data, respectively. The thick red curve shows the bot- 
tom of the sharp valley clearly. The lowest point of the valley 
is according to the best fit: r = 2.05 x 10~^ and a = 1.444. 

The prediction is available for the trend based on pa- 
rameters r and a of this stage, the middle day (March 
20, 2003) of the stage is applied as the first day and 

R = X]i=i Ri+i is assumed as /. A curve of squares 
is plotted in Fig. [3 for the first stage prediction. In the 
same way the best fit and prediction is applied for the 
next two stages and plotted in Fig. [5] The best fit is also 
processed weekly for detail discussion later. 

The curves for prediction clearly shows the 3 stages in 
Fig. H The first stage (March 12 - 27, 2003) exhibits dan- 
gerous exponential growth. It shows that the extremely 
infectious SARS coronavirus spread quickly in the pub- 
lic with few protections during this early stage. More 
seriously, it leads to a higher infection peak although in 
this stage the averaged number of new cases R is be- 
low 30. This prediction gets confirmation in the second 
stage (March 28 - April 11, 2003). The peak character- 
ized by the Amoy Garden outbreak comes earlier and 
higher. It indicates appearance of a new transmission 
mode that differs from intimate contact route observed 
in the first stage. We name the new transmission mode 
explosive growth in the contrast to transmission in the 
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FIG. 2: SARS data announced daily by the Hong Kong Dept. 
of Heath, the epidemic curve is plot as column graph. The 
curves of squares (r — 2.05 x 10^^ and a = 1.444), circles 
(r = 1.55 x 10"'^ and a = 1.172) and triangles (r = 1.09 x lO"'^ 
and a = 0.868) are prediction for the 3 stages described in the 
text. Each stage has 15 days. The curve of black dots is for 
revised data. 



first stage which we refer to as burning growth. Out- 
breaks at Prince Wales Hospital (PWH) (where SARS 
patients received treatment) and Amoy Gardens (a high- 
density housing estate in Hong Kong) represent infections 
of burning and explosive modes in the full epidemic, re- 
spectively. The revised data provided by HK Dept. of 
Health in Fig. |2 present a clearer indication for these 
two transmission modes. The outbreak of PWH sent the 
clear message that intimate contacts with SARS patient 
led to infection Detailed investigations of the prop- 
agation at Amoy Garden suggested that faulty of sew- 
erage pipes allowed droplets containing the coronavirus 
to enter neighboring units vertically in the building T^. 
Furthermore, poor ventilation of lifts and rat infestation 
where also suggested as possible modes of contamination 
[23, I21I l2^ |2J|. However, control measures bring the 
spread of the disease under control in the second stage. 
Contrary to the increasing trend in the first stage, the 
prediction curve of the 2nd stage (circle) declines. And 
it anticipates that new cases drop below 10 before the 
60th day (May 10, 2003). Also on April 12, 2003 we 
predicted number of whole infection cases would reach 
1700. Up to April 11, 2003 there were 32 deaths and 
169 recoveries^- We calculated the mortality of SARS 
as the ratio of death to sum of both deathes and recov- 
eries, and it was 15.9%. Therefore we predicted approx- 
imately 270 fatalities in total. In the third stage (April 
12 - 27, 2003), triangle curve in Fig. [21 refines results 
and gives more accurate prediction. This stage predicts 
that new cases per day drops to 5 before the 62th day. 
May 12, 2003. The travel warning for HK was cancelled 
by WHO because HK had kept new cases below 5 for 10 
days since May 15, 2003. And, finally, we predict that 
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the total cases reaches 1730 and nearly 287 deaths (up 
to April 27, 2003 there were 668 recovered and 133, the 
mortality increased to 16.6%). These numbers are very 
close to the true data^]- Precisely the method drawn 
from the SIR model has been verified for prediction for 
full epidemic. However, the accuracy is only possible by 
the first dividing the epidemic into separate " stages" . 

This problem of determining to what extent an epi- 
demic is under control is of greater strategic significance. 
Information on the efficacy of epidemic control will help 
determine whether to apply more control policies or not, 
and balance cost and benefit from them. For each indi- 
vidual the same question will also inform the degree to 
which precautions are taken: i.e. wearing a surgical mask 
to prevent the spread (acquisition) of SARS. Obviously a 
way to estimate control level is required. Actually this is 
a difficult problem because of significant statistical fluc- 
tuations in the data. A quite simple method to evaluate 
control efficiency is discussed below. 

In the SIR model Q, if A/ = J^+i - < a disease 
is regards as being controlled as new cases will decrease. 
This inequality leads to a control criterion for some dis- 
eases in epidemiology research. Applying the approxi- 
mation of S'i = iV we get a threshold Nr < a from jSJ. 
We rescale rN ^ r (r is called infection rate in place of 
infection coefficient now) and then get the threshold that 
is free from population N. 
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This indicates that the removal rate a exceeds the in- 
fection rate r. In Fig. O the circles show the 3 stages 
evaluations with dash line of r = a. And the parameters 
r and a of SARS data for HK are estimated weekly as 
squares also in Fig. 13 The line of r = a is regarded as 
the critical line since number of infected cases increases 
when (r, a) passes through it from below. It is possi- 
ble to apply the diagram of r and a to compare control 
level for different countries and areas even for different 
diseases. This provides organizations like WHO with a 
simple and standard method to supervise infection level 
of any disease. The limitation of the method comes from 
the assumptions of the SIR model. More accurate models 
may provide better estimate of epidemic state and future 
behaviours. 

In summary, a discrete SIR model gives good predic- 
tions for epidemic of SARS for HK. Two distinct methods 
are described for the disease propagation dynamics. Par- 
ticularly the explosive mode is much more hazardous 
than the burning mode. We have introduced a sim- 
ple method to evaluate control levels. The method is 
generic and can be widely applied to various epidemio- 
logical data. 



II. THE SMALL- WORLD MODEL 

Contrary to the long established SIR model, epidemi- 
ology research using Small- World (SW) network mod- 
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FIG. 3: The best fit parameters estimated week by week from 
SARS data of HK are plotted as squares. The circles are for 
3-stage analysis mentioned in Fig. |21 In this panel the line of 
dots r = a is regarded as "alert line" or "critical line". The 
parameters of r and a below it indicate that the epidemic is 
controlled, above the line indicates uncontrolled growth. The 
parameter r is rescaled by rN — > r. 



els is young and growing area. The concept of SW was 
imported from the study of social network into nature 
sciences m 1998'l2j. However, it provides a novel in- 
sight for networks and arouses a lot of ex plor ations in the 
brain, social networks and the Internet. pi. 12^ l27l l2^ . 
Some SW networks also exhibit a scale-free (power law) 
distribution: a node in this network has probability 
P{k) ~ to connect k nodeSj_7 is one basic char- 
acter for the svstempsl |2^ |2^ [28j . Most researches 
of "virus" spreading with the SW model, concern the 
propagation of computer viruses on the Internet, how- 
ever, a few studies have been published relating to 
epidemiologyH^ IH El US 113 . The dynamics of SW 
models enriches our realization of epidemic and possi- 
bly provides better control policies|32l|. From the first 
SARS patient to the last one, an epidemic chain is em- 
bedded on the scale free SW network of social contacts. 
It is of great importance to discover the underlying struc- 
ture of the epidemic network because a successful quar- 
antine of all possible candidates for infection will lead to 
a rapid termination of the epidemic. In HK e-SARS, an 
electronic database to capture on-line and in real time 
clinical and administrative details of all SARS patients, 
provided invaluable quarantine information by tracing 
contacts "5, "33^. Unfortunately a full epidemic network 
of SARS for HK is still unavailable. Because data rep- 
resenting the underlying network structure is currently 
not available, we have no choice other than numerical 
simulations. Therefore, our analysis of the SW model is 
largely theoretical. The only confirmation of our model 
we can offer is that the data appear to be realistic and 
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exhibit the same features as the true epidemic data. 

To simulate an epidemic chain, a simple model of so- 
cial contacts is proposed . The model is established on a 
grid network weaved by m parallel and m vertical lines. 
Every node in the network represents a person. We set 
TO = 2700 with population N = ^ 7.29 x 10^. All 
nodes are initiated with a value of (named good nodes) . 
Every node has 2, 3 or 4 nearest neighbors as short range 
contacts for corner, edge and center, respectively. For ev- 
ery node there are two long range contacts with 2 other 
nodes randomly selected in the whole system everyday. 
These linkages model the social contact between individ- 
uals (i.e. social contacts that are sufficiently intimate to 
bring individuals at risk of spreading the disease). One 
random node of the system is set to 1 (called the had 
node), through its short and long range contacts, the 
value of the nodes linked with it turns into 1 according to 
probability of pi and p2, respectively. An infection hap- 
pens if a node changes its value from to 1. This change 
is irreversible. During the full simulation process, the bad 
node is not removed from the whole system. We make 
this assumption because the number of deaths is small in 
comparision of the population, and there is no absolute 
quarantine — even the SARS patient in hospital can affect 
the medical workers. Moreover, the treatment period for 
SARS is relatively length, and during this time infected 
indviduals are highly infectious. To reflect the true vari- 
ation in control strategy and individual behaviour, the 
control parameters pi_2 (0 < pi_2 < 1) vary with time. 




FIG. 4: A plague is simulated in a geographical map with 
100 X 100 nodes with fixed infection probability pi_2 = 0.005 
which eventually leads to a full infection, a). The full epi- 
demic curve of the simulated epidemic, b). While time is 
45, the infection cases clusteringly scatter in the map. The 
infection happens in full scale map because of random con- 
nections. The clusters show effect from the nearest neighbour 
connections. 

In same model it would be interesting to compare 
with the complete infection epidemic. A small system 
with 100 X 100 nodes is chosen. A fixed probability 
Pi = P2 = 0.05 leads to eventual infection of the entire 



population since all nodes are linked and the infected 
are not removed. The epidemic curve for this process is 
plotted in Fig. 01 a) and is typical of many plagues. In 
Fig. 0] b) various sizes of clusters with infected nodes 
(black dots) scatter over the geographical map. It con- 
tains all infection facts in first 45 days. Comparing to 
the true SARS infection distribution in HK only slight 
similarity is observed. The short and long range linkages 
give good infection dynamics as we expected. The epi- 
demic chain is easily drawn by recording infection fact 
from the seed to last patient. However, during simula- 
tions there is a problem: for a good node linked by more 
than one bad nodes, which bad node infects it? A widely 
accepted preferential attachment of rich-get-richer is a 
good answer A linear preference function is 

applied here. With presence of both growth and prefer- 
ential attachment, it is general to ask whether the chain 
is a scale-invariant SW network. We plot the distribution 
in Fig. |31a) and b) in log- log and log-linear coordinates, 
respectively. The hollow circle is for the system with 
100 X 100 nodes. The scaling behaviour of it looks more 
like a piecewise linear (i.e. bilinear) in b) rather than a 
power law in a) . To confirm this we tried a larger system 
with 1000 X 1000 nodes and the same fixed pi_2 in the 
rest curves in Figs. El The solid square curve is distribu- 
tion of the whole epidemic network. The linear fit for the 
solid square gives a correlation coefficient R of —0.956 in 
Fig. |5la). In b) piecewise linear fits have R of —0.997 
and —0.994. These provide positive support for the orig- 
inal model. The ratio of the cases of first 30% and 1% of 
whole process for this bigger system are plotted as hollow 
triangle and solid circle curves in Figs. |S1 respectively. 
The case of 1%, early stage of full infection epidemic, 
suggests a better fit for scale-free than the other cases 
since it has correlation coefficients R of —0.983 and ra- 
tio 7 ~ —2.89 for linear fit in a). For the solid circle 
curve piecewise linear fits give R of —0.995 and —0.977 
in Fig. Elb). So, what scaling behaviour is true during 
full infection process? With absence of rigorous proof 
this problem is hard to answer correctly in simulations. 
We may only draw conclusions based on which simula- 
tions most closely matches the qualitative features of the 
observed data. 

Let's return to the system with 2700 x 2700 nodes for 
modelling SARS for HK. The probability pi_2 is fitted 
to the true epidemic data. However, it is fruitless to 
obtain an exact coincidence between the simulated re- 
sults and the true data as the model evolution is highly 
random (moreover this would result in overfitting). The 
control parameters pi.2 (dots and dashes curves) and a 
simulated epidemic curve (black dots) with column dia- 
gram of SARS for HK is plotted in Figs. El a) and b), 
respectively. For the simulated data, the total number 
of cases is 1830 that has a 4.3% deviation from the true 
data of 1755. Contrary to the above full infection with 
fixed parameters, pi^2 are believed to drop exponentially 
and lead to small part infection epidemic without quar- 
antine or removal. In any case, the high probability of 
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FIG. 5: The scaling behaviour for a full infection epidemic 
network is plotted in log-log and log-linear coordinates in a) 
and b), respectively. The circle is for a full infection epi- 
demic network simulated in a geographical map with 100 x 100 
nodes. The curves are for one with 1000 x 1000 nodes. The 
curves of dots, triangles and squares are for the epidemic 
networks of 1% , 30% and 100% for full infection process. 
The two dash lines are piecewise linear fits for the full infec- 
tion process with 1000 x 1000 nodes. The ratio is —0.31 and 
-0.07 with R = -0.997 and -0.994, respectively. It clearly 
shows the curves of full epidemic has two piecewise linear 
parts in log-linear graph. In the early stage of a full infection, 
the scaling behaviour of 1% infected (dots) has a part which 
might be regards as a log-log linear. The ratio of it is —2.89 
with R — —0.983. In other words, it has scale-free part with 
7 = -2.89. 



infection in the early stage is indicative of the critical 
ability of the SARS coronavirus to attack an individual 
without protection. If SARS returns, the same high ini- 
tial infection level is likely to occur. The only hope to 
avoid a repeat of the SARS crisis of 2003 is to shorten the 
high infection stage by quick identification, wide protec- 
tion and sufficient quarantining. In other words, the best 
time to eliminate possible epidemic is the moment that 
the first patient surfaces. Any delay may lead to a wors- 
ening crisis. The long range infections in the model and 
the world also imply an efficient mechanism to respond 
rapidly to any infectious disease is required to establish 
global control. 

Data on the geographical distribution of SARS cases in 
HK is much easier to collect than the full epidemic chain. 
Numerical simulations provide both simply. The full ge- 
ographical map marked with all infected nodes (black 
dots) and an amplified window of a cluster is plotted in 
Figs. El a) and b), respectively. Similarity is expected 
and verified in cluster patterns of Figs. Elb). The scaling 
behaviour of the epidemic chain is plotted in Figs. |H1 
The curve in log-log diagram exhibits a power-law coef- 
ficient of 7 = —3.55 and gives the linear fit correlation 
coefficient R = —0.989. The piecewise linear fit for the 
log-linear case gives correlation coefficients R of —0.987 
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FIG. 6: Control parameters and epidemic curve is plotted for 
the model with 2700 x 2700 nodes, a) The short and long 
range linkages infection probability pi,2 generally declines ex- 
ponentially, b) The simulated epidemic curve (black dots) 
in the model is plotted with the original SARS data (grey 
column) for HK. 



2700 

1800 
> 
900 



a) 



b) 



••••••••• 

••••••• •••• 

••••••• 

•••••• • 

•••••••• 



• ••• 
••••• 
• 
• 



125 
120 
-115 

Y 

110 
105 



100 



900 1800 2700250 255 260 265 270 275 

X X 

FIG. 7: The distribution of infected nodes in simulations in 
two dimensional map. a), the whole geographical map; b). 
an amplified window of a cluster marked in circle of a). 



and —0.959, respectively. 

For a SW network often few vertexes play more im- 
portant roles than others[3^. SARS super-spreaders 
found in HK, Singapore and China are consistent with 
this^l^ .36il. Data for the early spreading of SARS in 
Singapore jSg show definite SW structure with a small 
number of nodes with a large number of links. The av- 
erage number of links per node also shows a scale-free 
structure |36j |. but the available data is extremely lim- 
ited (the linear scaling can only be estimated from three 
observations) . 

This character is also verified in our model. The first 
few nodes have a high chance of infecting a large num- 
ber of individuals. In Fig. |S1 a single node has 40 links. 
Clearly the index node has many long range linkages. 
It has been suggested that travelling in crowded public 
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FIG. 8: The scaling behaviour for the simulated epidemic 
network. In log-log coordinates, the scaling P{k) ~ 7 = 
—3.55, where the linear fit exhibits a correlation coefficient 
R = -0.989. 



control an epidemic. Actually, in our model, if duration 
of the early stage with high probability pi^2 is reduced to 
less than 10, the infection scale decreases sharply. 

An engrossing phenomenon is the points of inflection 
in curves in log-linear diagrams of Figs. |H1 and Figs. [S] 
b). All are located near linkages number of 6-7. On 
average a nodes has about 6 contacts (2-4 short plus 2 
long range) every day, although in whole process there 
is no limitation for linkages. For a growing random net- 
work, a general problem always exists among its scaling 
behaviour, preferential attachment and dynamics, even 
embedding a geographical maD|34 1371 ISSL l39l . More 
work is required to address this issue. 

In conclusion a SW epidemic network is simulated to 
model SARS spreading in HK. A comparison of the sim- 
ulations with full infection data is presented. Our discus- 
sion of the infection probability and occurrence of super 
spreaders lead to the obvious conclusion: rapid response 
of an individual and government is a key to eliminating 
an epidemic with limited impact and at minimal cost. 



places (train, hospital, even an elevator) without suitable 
precautions can cause an ordinary SARS patient to infect 
a significant number of others. Again, this is an indica- 
tion that to increase an individual's (especially a proba- 
ble SARS patient's) personal protection is key to rapidly 
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